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[57] ABSTRACT 

A method and an apparatus for the rapid learning of 
nonlinear mappings and topological transformations 
using a dynamically reconfigurable artificial neural 
network is presented. This fully-recurrent Adaptive 
Neuron Model (ANM) network has been applied to the 
highly degenerative inverse kinematics problem in ro- 
botics, and its performance evaluation is bench-marked. 
Once trained, the resulting neuromorphic architecture 
was implemented in custom analog neural network 
hardware and the parameters capturing the functional 
transformation downloaded onto the system. This 
neuroprocessor, capable of 10 9 ops/sec, was interfaced 
directly to a three degree of freedom Heathkit robotic 
manipulator. Calculation of the hardware feed-forward 
pass for this mapping was benchmarked at ~ 10 jisec. 
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ADAPTIVE NEURON MODEL — AN 
ARCHITECTURE FOR THE RAPID LEARNING OF 
NONLINEAR TOPOLOGICAL 

TRANSFORMATIONS 5 

ORIGIN OF THE INVENTION 

The invention described herein was made in the per- 
formance of work under a NASA contract, and is sub- 
ject to the provisions of Public Law 96-517 (35 USC 10 
202) in which the contractor has elected not to retain 
title. 

BACKGROUND OF THE INVENTION 

1- Technical Field 15 

The invention relates to fully recurrent analog neural 
networks and in particular to apparatus and methods of 
training such networks by continuous updating of syn- 
apse weights and neuron temperatures. 

2. Background Art 20 

A problem that has remained consistently at the fore- 
front of neural network research for the past several 
years, concerns the scalability of both neural networks 
architectures and their associated learning algorithms. 
Novel ideas for fast learning algorithms have surfaced 25 
in the literature, and include such formalisms as net- 
works with non-Lipschitzian dynamics; terminal attrac- 
tors; and locally-tuned processing units. There have 
also been significant developments in the area of dy- 
namically reconfigurable neural network topologies for 30 
optimal use of processing assets. These include the re- 
source allocating neural network (RANN) and the cas- 
cade correlation neural network (CCNN). However, 
the concern still remains that these approaches are 
geared toward ‘small’ problems that can be relatively 35 
easily implemented and trained in software within finite 
time on sequential digital computers. However, com- 
plex network architectures, which can model more 
realistic problems, often require specialized parallel 
hardware solutions as they prove to be otherwise fre- 40 
quently intractable computationally. Although such 
parallel hardware solutions are just beginning to emerge 
commercially, the problem of fast parallel on-chip or 
off-chip learning is still an issue. The fundamental prob- 
lem in learning, therefore, is to further develop, refine 45 
and speed-up present learning algorithms which are 
capable of either extracting or memorizing the func- 
tional relationship linking the input-output data pairs of 
observations in a more rapid fashion. Of course the 
desire is to consequently use this information to predict 50 
the correct response to novel input patterns in real time. 

In this specification, our initial focus is to develop the 
necessary mathematical formalism for a new connec- 
tionist learning architecture called the Adaptive Neu- 
ron Model (ANM) which has been designed to rapidly 55 
learn arbitrary, complex, nonlinear transformations 
from example. As the name of the model implies, the 
architecture allows both the synaptic and neuronal pa- 
rameters to adapt. Training statistics indicate a consid- 
erable gain in training time is achievable. This model 60 
has been applied to a broad class of problems and has 
been shown to achieve functional synthesis on the train- 
ing data. Of particular relevance to applications of the 
ANM model, is the class of problems commonly known 
as inverse mapping problems. These inverse problems 65 
are typically nonlinear, and are usually characterized by 
their one-to-many mapping operation. In other words, 
the specification of a goal does not uniquely determine 
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the action that must be carried out to meet the task at 
hand. Such systems are said to be redundant or degener- 
ate. This means that there can exist numerous if not an 
infinite number of distinct solutions for the system vari- 
ables, whilst at the same time being entirely consistent 
with the desired task at hand. 

Specifically, a problem that fits directly into this 
mold and has received much attention from the neural 
network community is that of the inverse kinematic 
problem for a robotic manipulator with excess degrees 
of freedom. It should be observed that biological sys- 
tems handle such ill-defined or ill-structured problems 
such as sensorimotor control with remarkable ease and 
flexibility and reveal a spontaneous emergent ability 
that enables them to adapt their structure and function. 
This provides the motivation for recourse to biologi- 
cally inspired paradigms for such problems. The inverse 
kinematic problem has been selected to benchmark the 
performance of the ANM learning algorithm. The in- 
ventor has previously addressed this same problem 
from the perspective of a feed-forward network em- 
ploying the backpropagation algorithm, as disclosed in 
U.S. patent application Ser. No. 07/473,024 filed Jan. 
31, 1990 by the present inventor and entitled “Neural 
Network with Dynamically Adaptable Neurons.” The 
use of connectionist architectures is not limited to the 
inverse kinematic problem only. In fact neural networks 
have been used for (a) unsupervised adaptive visual- 
motor coordination of a multijoint arm where the sys- 
tem can learn sensory-motor control from experience, 
(b) dynamic control of manipulators based on the 
CMAC approach of Albus, and (c) the neural learning 
of the mapping transformation between desired trajec- 
tory formation and the corresponding actuator move- 
ment. 

In selecting a neural network architecture, thought 
must also be given to the method for lifting the degener- 
acy problem present in these inverse mappings. Two 
distinct ways exist for achieving this. It can be accom- 
plished with the introduction of constraints embedded a 
priori within the training set. This is the method which 
we pursue in this specification when forming the train- 
ing set for the 3-link robotic manipulator. It may also be 
accomplished with the introduction of a penalty term 
within the energy function. The neural network is then 
trained to provide a solution that optimizes the penalty 
function p(x) that meets the specified constraints. 

In order to illustrate these points, we borrow an ex- 
ample from linear algebra. Consider the problem of an 
under constrained system of equations of the unknown 
variable x of the form Ax— b. Here, b is a constant 
vector quantity, and A is the matrix of constant coeffici- 
ents. For an under-constrained system of equations, 
there is no possible way for determining a unique solu- 
tion to x and in fact there exist an infinite number of 
solutions for it. Unique solutions can only be obtained 
by the introduction of an adequate number of con- 
straints equation in the variable x so as to properly 
constrain the system. 

In the last section of this specification, we report on 
an electronic implementation of the resulting neural 
network architecture that was put together from cus- 
tom analog ‘building block’ neural network chips devel- 
oped at the Jet Propulsion Laboratory. This electronic 
neuroprocessor has been interfaced with a commer- 
cially available Heathkit five degree of freedom robotic 
arm. In this example, the arm was constrained to motion 
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in the vertical plane using three degrees of freedom out 
of a total of five available degrees. The electronic 
neuroprocessor was shown to be capable of guiding the 
manipulator along arbitrary trajectories in real-time. 

SUMMARY OF THE INVENTION 1 

The invention is a method an apparatus for the rapid 
learning of non-linear mappings and topological trans- 
formations using a dynamically reconfigurable artificial 
neural network is presented. This fully-recurrent 10 
Adaptive Neuron Model (ANM) network has been 
applied to the highly degenerate inverse kinematics 
problem in robotics, and its performance evaluation is 
benchmarked. Once trained, the resulting neuro- 
morphic architecture was implemented in custom ana- 15 
log neural network hardware and the parameters cap- 
turing the functional transformation downloaded onto 
the system. This neuroprocessor, capable of 10 9 ops/- 
sec, was interfaced directly to a three degree of freedom 
Heathkit robotic manipulator. Calculation of the hard- 20 
ware feed-forward pass for this mapping was bench- 
marked at ~ 10 jisec. 

The invention trains an analog neural network of 
plural neurons connected together by respective synap- 
ses, the neurons having respective activity states and 25 
adjustable neuron temperatures, the synapses having 
adjustable synapse weights, by defining, relative to an 
error between activity states of output ones of the neu- 
rons and a predetermined training vector, predeter- 
mined time-dependent behaviors of: (a) the activity 30 
states, (b) the neuron temperatures in accordance with a 
gradient descent of the error in temperature space and 
(c) the synapse weights in accordance with a gradient 
descent of the error in weight space. The behaviors are 
governed by (a) an activity state relaxation time, (b) a 35 
neuron temperature relaxation time and (c) a synapse 
weight relaxation time, respectively. The invention 
continuously updates the neuron activity states, the 
neuron temperatures and the synapse weights of the 
analog neural network at respective rates correspond- 40 
mg to the relaxation times until the error is reduced 
below a predetermined threshold. The activity state 
relaxation time is shorter than the neuron temperature 
relaxation time and shorter than the synapse weight 
relaxation time, whereby for a given set of values of the 45 
neuron temperatures and synapse weights, the neuron 
activity states equilibrate before changes in the set of 
values occur. In the preferred embodiment of the inven- 
tion, each behavior is a corresponding differential equa- 
tion for each neuron. The behavior of the activity states 50 
is an activity state differential equation for each neuron 
defining the time derivative of the corresponding activ- 
ity state in terms of a product of the corresponding 
neuron temperature multiplied by the sum of products 
of corresponding ones of the synapse weights and cor- 55 
responding ones of the activity states. The continuously 
updating includes solving the activity state differential 
equation for the activity state repetitively at a rate cor- 
responding to the activity state relaxation time. The 
behavior of the neuron temperatures is a temperature 60 
differential equation for each neuron defining the time 
derivative of the corresponding neuron temperature as 
a product of an exponential function of the neuron tem- 
perature and the error transformed by a matrix of the 
synapse weights. The continuously updating further 65 
includes solving the temperature differential equation 
for the neuron temperature repetitively at a rate corre- 
sponding to the temperature relaxation time. The be- 


4 

havior of the synapse weights is a weight differential 
equation for each synapse defining the time derivative 
of the corresponding synapse weight as a product of an 
exponential function of the corresponding neuron tem- 
perature and a sum of products of the synapse weights. 
The continuously updating further includes solving the 
weight differential equation for the synapse weight 
repetitively at a rate corresponding to the weight relax- 
ation time. The differential equations are solved in an 
order of ascending relaxation times. The invention em- 
ploys the results of the solving of a previous differential 
equation in solving a next differential equation. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 illustrates the network topology and connec- 
tivity of a fully recurrent neural network included in the 
invention. 

FIG. 2 illustrates a robotic arm employed in carrying 
out one application of the invention. 

FIG. 3 illustrates the degeneracy problem for a robot 
manipulator solved by the invention. 

FIG. 4 illustrates the smoothness criteria for a trans- 
formation employed in robotic kinematics emulated by 
the invention. 

FIG. 5 illustrates the reductions of a robotic system 
to a 3 link kinematic chain emulated by the invention. 

FIG. 6 illustrates a cartesian end effector space se- 
lected for training the network in accordance with the 
invention. 

FIG. 7 illustrates the joint space spanned by three 
joint angles emulated by the invention. 

FIG. 8 is a graph illustrating the training statistics of 
a neural network embodying the invention. 

FIG. 9 is a graph illustrating the performance of a 
neural network embodying the invention on a training 
set. 

FIG. 10 is a graph illustrating the performance of a 
neural network embodying the invention on untrained 
examples. 

FIG. 11 illustrates planned versus actual paths using a 
trajectory generation method in carrying out the inven- 
tion. 

FIG. 12 is a block diagram of a feed forward neural 
network hardware implementation embodying the in- 
vention. 

FIG. 13 is a schematic diagram of a robotic controller 
emulated by the invention. 

FIG. 14 is a graph illustrating end effector position- 
ing accuracy as a function of weight quantization in 
carrying out the invention. 

FIG. 15 is a flow diagram illustrating the algorithm 
for continuously updating neuron temperatures and 
synapse weights in accordance with the invention. 

DETAILED DESCRIPTION OF THE 
INVENTION 

Adaptive Neuron Model 

The basic premise behind the Adaptive Neuron 
Model is that the neurons should be allowed to actively 
participate in the learning process. Prior neuromorphic 
models regarded the neuron as a strictly passive nonlin- 
ear element and the synapse as the primary source of 
knowledge capture and information retention. In this 
section, we develop the necessary mathematical formal- 
ism for the Adaptive Neuron Model. This is a fully 
recurrent extension of earlier work done by the inven- 
tor described in the above-referenced patent applica- 
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tion. The dynamics of a neural network consisting of a 
system of N interacting neurons is specified by the set of 
N coupled nonlinear differential equations. These are 
given by: 


dxi ( f N 1\ 

— - + gi | J 2 1 W'jXj + // J J 


where x/ represents the activity of the \ th neuron in a 10 
system of N interacting neurons and is bound on the 
interval [0,1]; where w,y is the inter-neuron coupling 
coefficient (or connection strength) and is direction 
sensitive in the sense that w,y is directed from the ] th 
neuron to the \ th neuron; where the constant 1/ repre- 15 
sents the external stimulus received by the \ th neuron; 
where Pi is the inverse of the temperature of the \ th 
neuron, and is a local parameter expressed as )3/=T/ -1 . 
The function g*( ) is a nonlinear function, and the only 
restriction in choosing this function is that it be differen- 20 
tiable. There are several popular choices for g ; ( ), how- 
ever two commonly used forms include the logistic 
function and the gaussian local response function. These 
are given by: 

25 


(1 + logistic function 

ae — y«2 gaussian function 


space therefore uniquely specifies the state of the neu- 
rons. Fully-recurrent algorithms already exist that will 
adjust the synaptic weight matrix W so that a given 
initial state x*, and a given input vector I result in a fixed 
point xf whose components have the desired set of tar- 
get values tj j along the output units. This is accom- 
plished by minimizing a function E (the so-called en- 
ergy function) which measures the distance between the 
desired fixed point and the actual fixed point over the 
entire training set. For M output units, this function is 
given by: 


E = \ 2 0?/ - xi) 2 


The function E depends on^ the weight matrix W 
through the fixed points, i.e. x^(w,y). A learning algo- 
rithm must therefore be capable of driving the fixed 
points towards the manifolds which satisfy x/=tj/ on the 
output units. This is accomplished dynamically by al- 
lowing the system to evolve in the weight space along 
trajectories which are antiparallel to the gradient of E. 
In other words, 

dyvrs _ bE ( 4 ) 

T * dt “ zw n 

where r w is a numerical constant which defines the 


and where a, /3, and y are constants that modify the 
shape of these nonlinear response functions. In the fol- 
lowing derivation, we shall make reference to the popu- 
lar logistic function to illustrate the formalism. Exten- 
sion to other response functions is direct. The state of 35 
this dynamical system can be uniquely defined in the 
following manner. For a system of N interacting neu- 
rons, the collective activity of the system i.e., (x*, all i 
e{l,N}) defines a unique point, and hence state, in an N 
dimensional unit cube. The unit cube restriction comes 40 
from the boundary conditions imposed on the variables 
Xi in Equation 1. For the sake of the argument, if we 
consider the adiabatic response of the system, i.e. where 
both the synaptic weights and the neuronal tempera- 
tures remain quasi-static, then the temporal evolution of 45 
the system may then be viewed as a continuous path in 
this N-dimensional unit cube of activity. The dissipative 
nature of the differential equation ensures that for a 
given set of initial conditions, as given by the vector x f , 
external stimulus I and the weights matrix W, the sys- 50 
tern will ultimately relax into a final state which we 
denote by xf. In the following derivation, we use the 
convention where vectors, vector components and ma- 
trix components are denoted by lower case letters with 
the appropriate number of indices, and matrices are 55 
correspondingly denoted by upper case letters. During 
training, two other spaces are of relevance. First, there 


relatively slow time scale for solving these differential 
equations for the weights. The reason for r w to be long 
arises from the requirement that the dynamical system 
given by Equation 1 must always be at a steady state. In 
other words, if we let the numerical constant r x defines 
the time scale on which the dynamical system relaxes, 
then t x < <t w , and implies that the system of Equations 
1 relax on a time scale where it would appear that the 
weights are static, i.e. stationary. In the following sec- 
tion, a similar formalism is derived for the neuronal 
parameters. 

0.1 Neurodynamical formalism 
In this section, the general overview described above 
is formalized into a mathematical framework which 
fuses together into one fully recurrent formalism, both 
synaptic weight matrix learning (as proposed and devel- 
oped by Pineda and neuronal temperature learning. 
Therefore, in addition to the system of differential equa- 
tions given by Equations 1 and 4, our goal is to find a 
local algorithm which adjusts the temperature vector 
of the neurons so that for a given initial state x f , external 
stimulus i, and weight matrix W, the system relaxes into 
the desired point attractor structure. The specific 
method selected by the inventor for accomplishing this 
task is to allow the system to evolve in the temperature 
space along trajectories which are anti-parallel to the 
gradient of the error function E given in Equation 3. In 
other words. 


is the N X N dimensional space covering the real valued 

synaptic elements (w,y, all i,j e{l,N}). An NxN-dimen- _ d$ r __ _ a£ (5) 

sional point in this space uniquely defines the weight 60 7 & ~ dt ~ ip r 

state of the system. Furthermore, there is now an addi- 
tional N-dimensional positive real valued space (fit, all i where rp is a numerical constant that defines the time 

e{l,N}) spanning all possible combinations of the neu- scale on which the neuronal temperatures changes, 

ronal temperature parameters. Specific considerations Again, the time scale selected for rp is relatively long, 

on physically meaningful or allowable values for neuro- 65 i.e. t x < <rp. The reasons for this inequality is similar to 
nal parameters have to be made. For the logistic func- that described above for the weight updates. During the 

tion in Equation 1, it makes no sense considering nega- relaxation of the dynamical system into its attractor 

tive 0i temperatures. A point in this N dimensional structure, (in accordance with Equation 1), all parame- 
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ters including the synaptic weights and neuronal tem- 
peratures must be static. Therefore, the weights and 
temperature update time scales can be chosen to have 
similar time scales, i.e. t^t w . Consider again the sys- 
tem of N interacting neurons. Of the various neurons in 5 
this network, there will exist three subsets: an input set, 
an output set, and a hidden set of neurons. The hidden 
set of neurons are those that are not connected to the 
outside world. A representation of this neural network 
topology is given in FIG. 1. 10 

As was stated earlier, the dynamics of the system of N 
interacting neurons is determined by Equation 6. 


Mij— dij— fig' i Wy 

Then Equation 12 can be rewritten as 


N a xf 

i Mij ~~ 8irgf(u/)ui 


03) 


04) 


But since we want to solve for the partial derivative of 
x/ with respect to the temperature parameter £ r , the 
matrix M in equation (14) must be inverted 


“5T = -*<' + « (ft jf , + 7 ' ]) 


(6) 15 1 JT = 


(15) 


Let us now find the local algorithm which will adjust 
the temperature vector % It was stated earlier that one 
way of accomplishing this was to let the neuron temper- 20 
atures evolve in temperature space along trajectories 
which are anti-parallel to the gradient of the error func- 
tion given in Equation 3. In other words, 


Substituting Equation 15 into Equation 8 results in the 
following expression 


dfir r * N 1 1 

r/3— = JfM~ X )jr 


(16) 


r bE 

■ T & dt d(2 r 


(7) 
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Let us define the vector quantity y to be a vector quan- 
tity whose component is equal to the expression con- 
tained within the brackets in Equation 16, i.e. 


This choice of gradient descent has the simplest dynam- 
ics which minimizes E. On performing the differentia- 
tion in Equation 7 one immediately obtains 


y/= gr'iu/) 


(17) 


dfir N dxf 

T *— - .s, (*/-*/> 


( 8 ) Then the differential equations governing the rate of 
change of the temperature fir can be expressed as 


35 


The partial derivative of x/ with respect to fi r is ob- 
tained by noting that the fixed points of Equation 6 
satisfy the nonlinear algebraic equation 


Mr f f 


(18) 


xf = Si 2 ^ Wijxj + /,* 


(9) 


The problem is that in Equation 15, the calculation of 
40 the matrix inverse is anon-local operation. Pineda 
showed a methodology for working around this prob- 
lem. Consider again Equation 17, and regroup the 
terms, i.e. 


Taking the partial derivative of this equation with re- 
spect to fi r results in the following expression, where the 45 
right hand term is simply the result of the chain rule, 




(19) 


dxf 

IS 


r " "W+ *)l ! 7 + A C»1 )) 


( 10 ) 


50 


Equation 19 can be rearranged into the following ex- 
pression 


But the partial differential on the left hand side of Equa- 
tion 10 can be expressed as a summation over all neu- 
rons by using the Kronecker delta function, 


55 


N ( yif \ 

•-aMi ^r) +Jr 


( 20 ) 


dxf N dxf 

= j~i 6ij ~ 


(ii) 


Let us further define the vector quantity z to be the 
quantity in brackets in Equation 20. 


where dij— 1 iff i=j. Substituting Equation 1 1 into equa- 60 
tion 10 and regrouping terms, z ff 


.('jL'I 

\gk'(U[f) ) 


( 21 ) 


N dxf r N 1 

2 ^ [8,y — fijgj w 0 ] — Si &ir I 2 ^ (.WijXj + //) I 


( 12 ) 


65 


But Equation 21 is a stable solution of the following 
differential equation 


Let us define the matrix M whose components are given 
by 


dz r N 

dt = ~ fcil MrkZ k + Jr 


( 22 ) 
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which can be easily solved. This leads to the following 
expression where we introduce timescales: 



and formulates all the necessary neuronal learning equa- 
tions. For the system to learn, it is necessary for the time 
scales to be properly selected. The dynamical equations 1° 
with explicit time scales are given as follows: 


dxi ( P N n\ 

~~[f~ - ~ x i + Si 2^ WjjXj + Ji J J 


( 24 ) 


15 


#r r f 


dl r 

dt 


N 

= - 2 M r fcZk + J r 

k - 1 


dw rs , f 
T »— = Vffx/ 

dv i 


ll- = -y r + .2^ W ir Vi + J r ^ 


(25) 

(26) 2Q 

(27) 

(28) 25 


The relaxation time scale for the forward propagation is 

30 

Equations 27 and 28 are taken from prior techniques 
based upon updating the synapse weights only. 

The relaxation timescale for the backward propaga- 
tion is T v . The weight adaptation time scale is given by 
t*. The adaptation time scale for the neuronal parame- 35 
ters is T£. It is straightforward to establish the relation- 
ships which must be satisfied by the characteristic time 
scales of the system. Recall that an a-priori requirement 
is that the system parameters w and I change adiabati- 
cally. This condition alone constrains the relative time 40 
scales. 

WORKING EXAMPLE 

A robotic manipulator may be thought of as a set of 
interconnected chain elements called links. Links are 45 
connected to one other via joints, of which one such 
joint is the re volute joint. Joints are characterized by 
their having a single degree of freedom. Therefore, for 
an n joint manipulator, the static state of system is en- 
tirely specified by an n-element vector, whose compo- 50 
nents provide a measure of the relative deflection of the 
two connecting linkages. Kinematics is the science of 
motion that deals with the geometrical description of 
these linkages. It is not to be confused with dynamics in 
that no reference is made to the forces and torques 55 
required to cause these joint to move. Kinematics there- 
fore involves coordinate transformations between 
frames attached to the joints, whose variables include 
joint positions, velocities, accelerations, and all higher 
temporal and spatial derivatives of the joint variables. 60 
For an n degree of freedom manipulator, the n dimen- 
sional space swept by all possible joint vectors is de- 
fined as the joint space. The direct kinematics problem 
requires the computation of the Cartesian state descrip- 
tion of the end-effector from knowledge of the joint 65 
state description. The direct kinematics calculation for a 
manipulator is a one-to-one transformation and conse- 
quently the major expense in calculating kinematics is 
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often the repeated calculation of the transcendental 
functions. 

0.2 Inverse Kinematics 

A fundamental motion control task in the hierarchy 
of robotic tasks necessary to operate a robotic manipu- 
lator is the inverse kinematics problem. It is based on a 
non-linear inverse mapping that involves the geometri- 
cal structure of the robot manipulator. Let us consider a 
robotic arm whose base is centered at the origin of some 
Cartesian frame of reference. In this specification mo- 
tion is restricted to 2-D. 

Any point in this space is completely specified by 
three orthogonal translations from the origin. Conse- 
quently, a rigid body requires at least three coordinates 
to specify its position and orientation relative to this 
origin. The inverse kinematics problem is the transfor- 
mation between the Cartesian coordinate system and 
the coordinate system particular to the robot manipula- 
tor, i.e. the joint space. In other words, if we specify a 
Cartesian location for the robot’s end-effector, we must 
determine the set of joint angles required by the robot to 
accomplish this task. The importance of this problem 
becomes clear when one considers that for the end- 
effector to achieve a desired trajectory in Cartesian 
space, one must determine how the joint angles vary 
with time along the entire trajectory. The transforma- 
tion between Cartesian space and the joint space de- 
pends on the manipulator’s geometry, and most geome- 
tries lack a closed form solution to the inverse kinemat- 
ics problem. Typically, an iterative search method must 
be implemented to perform this transformation. Fur- 
thermore, an added complication arises when redun- 
dant manipulators can reach the same end-effector posi- 
tion and orientation via many possible configurations. 
This poses the additional problem of selecting the opti- 
mal set of joint angles when a trajectory passes through 
a region of multiple solutions. 

In this specification we have considered a 3-degree of 
freedom manipulator moving in a 2-dimensional space, 
and have targeted both the inverse kinematics problem 
and the closely associated problem of trajectory genera- 
tion as representative tasks for a connectionist approach 
and for a hardware based neural implementation. FIG. 
3 shows the degenerate 3-link robotic arm and FIG. 2 
shows the arm and its workspace envelope. It is cen- 
tered at (0,0), where its links are constrained to move in 
the x-y plane. Points within this planar workspace 
bound by the first and fourth quadrant of the x-y plane, 
constitute valid Cartesian coordinates for the end-effec- 
tor. As stated above, the inverse kinematics problem 
deals with the transformation from the 2-dimensional 
Cartesian space (x,y) describing the manipulators envi- 
ronment to the 3-dimensional joint space (01,02,03) de- 
scribing the posture of the robotic manipulator. Points 
within the workspace, (x,y) can be reached by the arm 
only by specifying the associated set of joint angles. The 
transformation f: (x,y)— >(01,02,03) defines the inverse 
kinematics problem. 

0.3 Training Set Formation 

Within the neuromorphic framework, the solution of 
the inverse kinematic problem involves two phases. 
These are the training phase and the recall phase. It is 
the purpose of the training phase to encode within the 
network’s synaptic weights and neuronal parameters 
the inverse mapping. Great care must be taken to gener- 
ate an appropriate set of training samples that capture 
the essential characteristics of the mapping from the 
task space of the end-effector to its joint space. The 
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need for a careful selection of the training set becomes 
clear for the following reason. Consider a point well 
within the planar workspace. Since the problem for the 
three link uni-planar arm is degenerate, there is an infin- 
ity of valid arm configurations for each such training 5 
point we select in the workspace. Continuity condi- 
tions, however, require that in the absence of external 
conditions (such as obstacles) that the non-linear map- 
ping operation should reflect the nearness of Cartesian 
points (in task space) to a corresponding nearness in 10 
joint angles (in joint space). This functional nearness is 
shown in FIG. 4. If the training set is selected with this 
proviso in mind, then what the neural network captures 
is a smooth transformation linking the two spaces. This 
method allows us to imbed the constraints peculiar to 15 
the system via an appropriate choice of a training set 
and at the same time a priori lift the degeneracy inher- 
ent in the problem. 

There are numerous methods for generating a train- 
ing set consistent with the smoothness criteria which at 2 q 
the same time will lift the degeneracy. The method used 
in this specification is borrowed from the statics and 
dynamics of mechanical structures. It is based upon the 
observation that a catenary chain assumes a unique 
configuration in the presence of the gravitational field. 25 
A catenary chain is a continuous cable characterized by 
amass-per-unit-length term. For arbitrary positions of 
the two ends of the chain, the system always assumes a 
unique equilibrium configuration. In the discrete ap- 
proximation to the chain, we consider the 3-link robotic 3Q 
arm as a 3-link chain. Each link having a length term 1/ 
and a mass term m, through which it couples with the 
external gravitational field g. Furthermore, to make the 
semblance to a robotic system more realistic, one end of 
the chain is attached to the origin of our system, while 
the other end is allowed to move freely about the first J 
and fourth quadrants defining the workspace. The 3- 
link system is shown in FIG. 5. Each element is assigned 
both a length term 1/and a mass term m /. Coupling of the 
chain mass to the gravitational field lifts the degeneracy 
by uniquely specifying an arm configuration consistent 40 
with the extremum of the energy of the system. The 
above discussion allows us to formulate the problem. 
Referring to FIG. 5, the potential energy of the system 
composed of the three links is given by: 

45 

V( 6 1 ,02*03) = Cl sin 0 1 + C2 sin 02 +03 sin 03 (29) 

where the constants c/are functions of the link mass and 
length terms, and are given by: 


Cl = < ?/i(0.5mi-t-m2 + m3) 

(30) 

C2=g/2(0.5m2+m3) 

(31) 

C3=g/3(0.5m 3 ) 

(32) 


This system is subject to physical constraints which 
describe that the sum of the spatial Cartesian compo- 
nents must equal the end-effector position, i.e., 

4>l(01,02,03) = /l cos 0i-t-/2COS 02 + /3COS 03— (33)^ 

<J>2(0 1 >02*03) ~h sin 0 1 + /2 sin 02 + /3 sin 03 —y (34) 

We must now determine the extremum of the function 
V (0i, 02,03). Because of the nonlinearity of the equa- 65 
tions involved, substitution of the constraint equations 
into V (so as to reduce V to a function of a single inde- 
pendent variable) is not possible. The use of Lagrangian 
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multipliers is therefore required. Accordingly, we con- 
sider the augmented function given by 


V~V+Mfa + \ 2 4>2 

(35) 

For the function V to attain an extremum, 
and sufficient condition requires that 

a necessary 

dV v * *4* n 

— + k li k ~ = 

(36) 

This leads to the set of five simultaneous equations in 
five unknowns, i.e. the three joint variables and the two 
Lagrange multipliers. 

ci cos 0i~Xi/i sin 0[+X2h cos 01— 0 

(37) 

C2 cos 02— Xi/2Sin 02+^2 cos 

(38) 

c 3 cos 0 3 — Xi/ 3 sin 03+X2/3 cos 03=0 

(39) 

/l cos 0i 4-/2 cos 02+h cos 03— x=0 

(40) 

l\ sin 01 + /2 sin 02+/3 sin 03— y=0 

(41) 


The Newton-Raphson iteration technique was utilized 
to find the roots of this system of nonlinear equations. A 
minor problem with this approach is that good starting 
values are required for the algorithm to converge. The 
technique we employed was to divide-up the reachable 
workspace into a cellular grid. A cell was selected for 
which the joint angles could easily be guessed/deter- 
mined, (i.e. extended arm along the x-axis), and the 
Newton-Raphson method was used to determine the 
Lagrange multipliers for that cell. Because of the conti- 
nuity condition we imposed earlier, we expect that the 
determination of these angles and multipliers for a given 
cell provide good starting values for contiguous neigh- 
boring cells. In this way, we can obtain the angles nec- 
essary to form the training set. A significant subspace of 
the manipulator’s total reachable workspace was 
chosen over which to train the neural network. Thirty 
Cartesian end-effector positions uniformly spanning this 
subspace were selected and are shown in FIG. 6. These 
serve as the input elements of the training pairs. Thirty 
points uniformly span this space. Points selected were 
chosen to lie on a polar grid. Both the abscissa and 
ordinate are normalized on [0,1]. The output space, 
spanned by (01,02,03), corresponding to the thirty 
points in FIG. 6 are shown in FIG. 7. It is important to 
note that the space spanned by the Cartesian coordi- 
nates maps onto a smooth manifold in joint space. This 
guarantees that the continuity condition is met and that 
the neural network will be capable of generalizing be- 
yond the training set. FIG. 7 illustrates the joint space 
spanned by the three joint angles (01,02,03)* These thirty 
points correspond to the transformation of the corre- 
sponding points in Cartesian (x,y) pairs in the previous 
figure. 

0.4 Network Architecture 

Once the training set is selected, the next task is to 
apply the Adaptive Neuron Model formalism to the 
training data so as to capture the embedded inverse 
transformation of the 3-link inverse kinematic chain. 
The network is arranged into three sets of non-linear 
processing units; an input set, a hidden set, and an out- 
put set. Continuously variable synaptic weights w,y 
connect units from layer to layer within the network 
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and positive real valued temperatures /3/ characterize 
the neurons. Since the inverse kinematic task for the 
3-link arm involves a mapping from R 2 — ►R 3 , the input 
set consists of two input units representing the actual 
Cartesian (x,y) location of the end-effector. There are 5 
ten units on the hidden set. The output set consists of 
three units representing the desired joint angles. Since 
sigmoidal activation functions were chosen to have a 
dynamic range of [0,1], both the Cartesian coordinates 
and the joint angles used in the training set were prop- 10 
erly normalized. The results of training the neural net- 
work on the inverse kinematic problem are summarized 
in FIG. 8. The number of training iterations required to 
learn the problem to a given precision is shown plotted. 
Prior to training, the synaptic weights and neuronal 15 
parameters were randomized. As can be seen from FIG. 

8, the error decreases fairly rapidly after only a few 
hundred iterations. After training, the network was first 
tested on elements belonging to the training set. The 
network was prompted with uniformly distributed Car- 20 
tesian end-effector coordinates taken from the training 
set and the three joint angle values, as computed by the 
neural network, were compared with the correspond- 
ing target angles. These results are shown in FIG. 9 for 
nine desired end-effector Cartesian points uniformly 25 
covering a subspace of the total available workspace. 
The desired or expected placement of the end-effector 
are shown as open circles. The actual end-effector 
placement as given by the neurally calculated joint 
coordinates are shown as solid black circles. As can be 30 
seen, the solid circles overlap the open circles to a great 
degree of precision. Performance over the remaining 
available workspace was identical. Since we require 
that the network be able to generalize beyond the train- 
ing set and therefore learn the inverse kinematic trans- 35 
formation as opposed to memorizing the training set, 
the network was tested over uniformly distributed 
points covering the workspace that were not elements 
of the training set. These results are summarized in FIG. 

10 . As before, 20 points from within the same work- 40 
space subspace as tested in the previous figure were 
selected for clarity. Performance over the remaining 
region was identical. Again, desired end-effector place- 
ment are shown as open circles whereas actual end- 
effector placement are shown as solid circles. In this 45 
figure one observes some very minor discrepancies 
between the registering of the open and solid circles. 
However, it is evident that the neural network has cap- 
tured the underlying nonlinear inverse kinematic trans- 
formation, and that the network is performing some 50 
form of interpolation approximation for points that are 
not elements of the training set. For completeness, the 
network was prompted on points outside the training 
workspace boundary. In this instance, the positioning 
accuracy of the end-effector degraded gracefully but 55 
rapidly as one moved further and further from the 
boundary enveloping the training region. This indicates 
a form of limited extrapolation capability on the part of 
the network. 

0.5 Trajectory Generation Problem 
The trajectory generation problem is a higher order 
task in motion control hierarchy, and relies in part upon 
the solution of the inverse kinematics. The role of the 
trajectory planner in a robotic system is to determine 
the entire path of the arm from an initial position to 
some final position. Conventional approaches involve 
specifying a finite set of states of the manipulator along 
the desired trajectory. A state is a time independent 
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description of the manipulator relative to the origin of 
the frame of reference. This includes the location and 
orientation of each joint on the arm. The role of the 
trajectory planner, therefore, is to complete the specifi- 
cation of the path by smoothly linking these states while 
at all times not violating constraints imposed by either 
the existence of obstacles in the workspace or un-physi- 
cal arm configurations. Let us consider the motion of 
the end-effector. Again, we restrict the manipulator to 
motion in the plane. A trajectory specifies the path the 
end-effector is to follow in moving from an initial point, 
(x/,y/), to some final (x/,y/) point. A direct method of 
achieving this is by smoothly linking the trajectory’s 
end-points by a path specified by some algebraic expres- 
sion. If there are no constraints in the path, a straight 
line linking the two end points suffices. In the presence 
of obstacles, a special path described by some smooth 
polynomial function might be required. The robotic 
manipulator would be moved in the following manner. 
Starting from some initial position, the Cartesian coor- 
dinates would be incremented along the selected path 
and the inverse kinematic transformation would then be 
used to determine the corresponding joint angles. The 
joint actuators would then take care of the actual physi- 
cal rotation/translation of the various links on the ma- 
nipulator. This method relies upon repetitively solving 
the inverse kinematic equations along the whole path. 
Since these equations are very computationally expen- 
sive to solve by conventional approaches, the above 
method is not usually used. Conventional approaches 
rely on selectively solving the inverse kinematic equa- 
tions at both end points of the trajectory as well as a few 
via points on the trajectory. This constraint is non-exist- 
ent however in the neuromorphic approach outlined in 
the previous section, since there is no computational 
burden involved in obtaining solutions to the inverse 
kinematic equations in real time. Let the trajectory be 
specified by the function f, which is a function of the 
initial end-effector position and the time variable. This 
is given by 

(X r y) = ( 42 ) 

A trajectory is generated by incrementing the time 
variable t by an amount At. New x-y coordinates of the 
end-effector are obtained upon substitution in the above 
expression. These new cartesian coordinates are then 
fed to the neural network which is trained on the in- 
verse kinematic problem. In this fashion, we can gener- 
ate the incrementally changed joint angles. FIG. 11 
shows such a trajectory. The end-points for this trajec- 
tory are at (0.3, 0.2) and (0.5, 0.2). The trajectory that 
was selected to link these end-points was a sine wave. 
The figure compares the performance of the planned 
versus actual response of the neural network. The de- 
sired sine wave is shown as a solid line linking the end- 
points. The performance of the robotic manipulator at 
various points along the sine wave are shown as solid 
filled circles. The arm posture at these points is shown 
as a set of dotted linkages. There are very small discrep- 
ancies between the desired and actual end-effector posi- 
tioning. 

Hardware Implementation 

In this section, we describe the electronic implemen- 
tation of a feed-forward neuroprocessor using custom 
CMOS analog synaptic and neuronal chips developed 
at JPL. These chips were fabricated through MOSIS 
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with a 2 /im feature size. The synaptic chip is organized 
as a 32x32 crossbar configuration of synaptic cells. 
Each such synaptic cell has 7-bits of resolution, i.e. 128 
levels of quantization split up into two groups of 64 
levels — an excitatory and an inhibitory group. Techni- 5 
cal details of this chip may be found in A. Moopenn et 
al., “Digital Analog Hybrid Neural Network Simulator: 

A Design Aid for Custom VLSI Neurochips,” Proceed- 
ings of the SPIE Conference on High Speed Computing , 
1989, Los Angeles, Calif. The neuron chip is organized 10 
as a 1 X36 linear array of variable gain neurons. Using 
these neural network building block chips, a feed-for- 
ward neuroprocessor was implemented in hardware. A 
feed-forward structure was utilized rather than a fully 
recurrent structure for ease of electronic implementa- 15 
tion. This feed-forward structure consisted of 2 input 
units, two hidden layers of 5 units each, and an output 
layer of 3 units. FIG. 12 shows a block diagram of such 
an electronic network. It uses a single variable gain 
neuron chip wired up to a single synaptic chip to obtain 20 
the desired 2-S-5-3 layered neural network architecture. 
This configuration allows layered structures to be im- 
plemented by electronically carving out synaptic blocks 
of the appropriate dimension. Neuronal outputs from a 
given layer are fed back onto unused synaptic elements 25 
on the chip. In FIG. 12 , the input stage of the 2-S-5-3 
neuron architecture is shown on the upper left hand 
side. There are three inputs shown. The topmost input is 
the common input bias to neurons on all layers. Analog 
voltages representing the normalized x-y Cartesian co- 30 
ordinates are fed into the remaining two inputs. Neu- 
rons in the first hidden layer are shown in the lower left 
hand side of the chip, and connect with the input signals 
via the upper left hand 3x5 bank of synaptic cells. The 
output of the neurons folds back onto an unused 5x5 35 
bank of synaptic cells, shown in the central part of the 
chip. The output of these cells feed into the second 
layer of 5 hidden neurons. These neurons fold back in a 
similar manner onto an unused 5x3 bank of cells from 
which an output signal is derived after passing through 40 
the final layer of hidden neurons. The voltages appear- 
ing on the outputs of these three neurons represent the 
three scaled joint angles. It is these two inputs and three 
outputs that interface to the outside world. 

Overall, the hardware consisted of one synapse chip 45 
and one neuron chip and support circuitry wire- 
wrapped onto a VME-bus prototype card. This is 
shown schematically in FIG. 14 . The downloader cir- 
cuit comprised of a digital memory of weight values, a 
counter for accessing each memory location and each 50 
corresponding synapse in turn and some control logic 
for programming the selected synapse with its corre- 
sponding weight. A Compaq computer was pro- 
grammed with a master control routine, with which 
arbitrary feedforward network architectures could be 55 
mapped onto the hardware as described above. Boards 
with 8 channel search of 12-bit digital-to-analog and 
analog-to-digital converters were used to program and 
read network inputs and outputs, and a serial line with 
an efficient transfer protocol was used to program the 60 
network by way of a 6800-based VME bus controller. 
Consequently, architectures with up to 8 inputs and 
outputs, and 22 hidden units distributed among up to 8 
hidden layers, could be implemented. A VME-based 
RS-232 controller card acted as the necessary interface 65 
between the neuroprocessor and the microprocessor 
on-board the robotic arm. Since the computer simula- 
tions of the inverse kinematic problem made full use of 
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analog synaptic weight values, and since the synaptic 
hardware can only support limited quantization, the 
impact of finite synaptic weights on end-effector posi- 
tioning accuracy was assessed. On completion of train- 
ing, the synaptic weights w,y on all layers were observed 
to be bound. This range was given as — 4.5=iw,j= +4.5. 
In FIG. 14, the end-effector positioning accuracy is 
shown plotted as a function of the levels of quantization 
of the synaptic weights. This data was generated by 
discretizing the analog weights as a function of the n-bit 
weight precision, where n ranged from 1 to 12. The 
positioning error, defined by: 

i n , , . (43) 

Error = j ((* “ Xo)1 + O' " -Vo) 2 )* 

is an average over N Cartesian points of the euclidian 
distance between the end-effector position as given by 
analog synapses (x<>,y 0 ) and one as given by quantized 
synapses (x,y). The minimum resolution required is 
application sensitive. This could range from 4 to 12 bit 
resolution, depending on the end-effector positioning 
tolerances. In real world terms, an arm with a 1.0 m 
extension radius would reach it’s target within 1 cm for 
7 bit resolution, or within 0.6 mm for 12 bit resolution. 

Updating Algorithm 

Referring to FIG. 15 , the algorithm for updating the 
neuron temperatures and synpase weights during train- 
ing begins by solving Equation 28 for v r (block 10 of 
FIG. 15 ). Then, Equation 24 is solved for x/ (block 20 of 
FIG. 15 ). Next, Equation 26 is solved for z r (block 30 of 
FIG. 15 ). Then, Equations 25 and 27 are solved for ft 
and w & respectively. These equations are solved in 
ascending order of their time constants r, so that the 
quantities having the least relaxation times are solved 
for first and more frequently than quantities having 
greater relaxation times. The results of the solution of 
the previous equation are employed in solving the next 
equation. 

The time constants are selected to optimize a particu- 
lar problem being solved by the neural network. For 
example, if the problem is that of classification, then t@ 
is shorter than t^, while the reverse is true if the prob- 
lem is one of a continuous mapping relation. 

Conclusions 

In this specification, we have developed the Adaptive 
Neuron Model for learning in fully recurrent neural 
network architectures. The fundamental idea behind 
this model was to actively involve the neuron process- 
ing elements in the learning procedure. Prior ap- 
proaches have regarded the neurons as passive nonlin- 
ear processing elements and the synapses as the only 
source of information retention. In the ANM model, the 
functional nature of the neuron response function is 
non-critical for the mathematical developments. How- 
ever, the only requirements of the model is that the 
neuron response function be parametrized by a local 
parameter(s) (such as the temperature ft in the deriva- 
tion above) and be continuously differentiable in that 
parameter(s). This formalism has been shown to be 
extremely effective in reducing the training times on a 
variety of learning problems. Because of the ANM’s 
ability to rapidly learn complex nonlinear transforma- 
tions, it was trained on the inverse kinematic problem. 
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The algorithm was benchmarked on this supervised 
learning task and demonstrated an order of magnitude 
faster training time than an identical algorithm with the 
adaptive neuron feature disabled. Specifically, the 
ANM model was applied and trained from examples on 5 
the degenerate inverse kinematic problem in robotic 
control. The model was shown to rapidly capture the 
inverse transformation embedded within the training 
set. 

We have also implemented the feed forward neural 10 
network architecture for the inverse kinematic and 
trajectory generation problems in hardware. This con- 
sisted in interfacing custom VLSI synaptic chips with 
variable gain neuron chips developed at JPL so as to 
form a neuroprocessor. The synaptic chip is arranged in 15 
a 32 x32 cross-bar design and offers 7 bits of dynamic 
range or resolution. The simulated synaptic weights 
were down-loaded onto the hardware and the neuro- 
processor was used for the real time control of a manip- 
ulator. Results indicate that these problems can indeed 20 
be down-loaded from simulation to hardware at the cost 
of some loss in accuracy. For the robotic problem, there 
is a finite degradation in positioning accuracy over the 
simulation results. Improved positioning accuracy can 
be achieved by tweeking the synaptic weights once 25 
downloaded with chip-in-the-loop learning. 

While the invention has been described with specific 
reference to preferred embodiments thereof, it is under- 
stood that variations and modifications may be made 
without departing from the true spirit and scope of the 30 
invention. 

We claim: 

1. A method of training an analog neural network 
comprising plural neurons and synapses wherein said 
neurons are connected together by respective synapses, 35 
said neurons comprising respective activity states and 
adjustable neuron temperatures, said synapses compris- 
ing adjustable synapse weights, said method comprising 
the steps of: 

defining, relative to an error between activity states 40 
of a set of output neurons and a predetermined 
training vector, predetermined time-dependent 
behaviors of: (a) said activity states, (b) said neuron 
temperatures in accordance with a gradient de- 
scent of said error in temperature space and (c) said 45 
synapse weights in accordance with a gradient 
descent of said error in weight space, said behav- 
iors governed by (a) an activity state relaxation 
time, (b) a neuron temperature relaxation time and 
(c) a synapse weight relaxation time, respectively; 50 

continuously updating said neuron activity states, 
said neuron temperatures and said synapse weights 
of said analog neural network at respective rates 
corresponding to said relaxation times until said 
error is reduced below a predetermined threshold. 55 

2. The method of claim 1 wherein said activity state 
relaxation time is shorter than said neuron temperature 
relaxation time and shorter than said synapse weight 
relaxation time, whereby for a given set of values of said 
neuron temperatures and synapse weights, said neuron 60 
activity states equilibrate before changes in said set of 
values occur. 

3. The method of claim 2 wherein: 

said behavior of said activity states is an activity state 
differential equation for each neuron governing a 65 
time derivative of a corresponding activity state in 
terms of a product of a corresponding neuron tem- 
perature multiplied by a sum of products of corre- 
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sponding ones of said synapse weights and corre- 
sponding ones of said activity states; and 

said continuously updating comprises solving said 
activity state differential equation for said activity 
state repetitively at a rate corresponding to said 
activity state relaxation time. 

4. The method of claim 3 wherein: 

said behavior of said neuron temperatures is a temper- 
ature differential equation for each neuron defining 
a time derivative of a corresponding neuron tem- 
perature as a product of an exponential function of 
said neuron temperature and said error between 
activity states of the set of output neurons and the 
predetermined training vector transformed by a 
matrix of said synapse weights; and 

said continuously updating step further comprises 
solving said temperature differential equation for 
said neuron temperature repetitively at a rate cor- 
responding to said temperature relaxation time. 

5. The method of claim 4 wherein: 

said behavior of said synapse weights is a weight 
differential equation for each synapse defining a 
time derivative of a corresponding synapse weight 
as a product of an exponential function of the cor- 
responding neuron temperature and a sum of prod- 
ucts of said synapse weights; and 

said continuously updating step further comprises 
solving said weight differential equation for said 
synapse weight repetitively at a rate corresponding 
to said weight relaxation time. 

6. The method of claim 5 wherein said activity state 
differential equation, temperature differential equation, 
and weight differential equation are solved in an ascend- 
ing order associated with an order of ascending relax- 
ation times, such that a result of a solving of a preceding 
differential equation in the ascending order is employed 
in solving a next differential equation in the ascending 
order. 

7. The method of claim 1 wherein said neural net- 
work is a fully recurrent neural network. 

8. The method of claim 1 wherein said predetermined 
training vector corresponds to a classification problem 
and said temperature relaxation time is shorter than said 
weight relaxation time. 

9. The method of claim 1 wherein said predetermined 
training vector corresponds to a continuous mapping 
relation, and said weight relaxation time is shorter than 
said temperature relaxation time. 

10. Apparatus for training an analog neural network 
comprising plural neurons and synapses wherein said 
neurons are connected together by respective synapses, 
said neurons comprising respective activity states and 
adjustable neuron temperatures, said synapses compris- 
ing adjustable synapse weights, said apparatus compris- 
ing: 

means for defining and storing, relative to an error 
between activity states of a set of output neurons 
and a predetermined training vector, predeter- 
mined time-dependent behaviors of: (a) said activ- 
ity states, (b) said neuron temperatures in accor- 
dance with a gradient descent of said error in tem- 
perature space and (c) said synapse weights in ac- 
cordance with a gradient descent of said error in 
weight space, said behaviors governed by (a) an 
activity state relaxation time, (b) a neuron tempera- 
ture relaxation time and (c) a synapse weight relax- 
ation time, respectively; 
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means for continuously updating said neuron activity 
states, said neuron temperatures and said synapse 
weights of said analog neural network at respective 
rates corresponding to said relaxation times until 
said error is reduced below a predetermined 5 
threshold. 

11. The apparatus of claim 10 wherein said activity 
state relaxation time is shorter than said neuron temper- 
ature relaxation time and shorter than said synapse 
weight relaxation time, whereby for a given set of val- 10 
ues of said neuron temperatures and synapse weights, 
said neuron activity states equilibrate before changes in 
said set of values occur. 

12. The apparatus of claim 10 wherein: 

said behavior of said activity states is an activity state 15 
differential equation for each neuron governing a 
time derivative of a corresponding activity state in 
terms of a product of a corresponding neuron tem- 
perature multiplied by a sum of products of corre- 2Q 
sponding ones of said synapse weights and corre- 
sponding ones of said activity states; and 

said means for continuously updating comprises ’ 
means for solving said activity state differential 
equation for said activity state repetitively at a rate 25 
corresponding to said activity state relaxation time. 

13. The apparatus of claim 12 wherein: 

said behavior of said neuron temperatures is a temper- 
ature differential equation for each neuron govern- 
ing a time derivative of a corresponding neuron 30 
temperature as a product of an exponential func- 
tion of said neuron temperature and said error be- 
tween activity states of the set of output neurons 
and the predetermined training vector transformed 
by a matrix of said synapse weights; and 35 
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said means for continuously updating further com- 
prises means for solving said temperature differen- 
tial equation for said neuron temperature repeti- 
tively at a rate corresponding to said temperature 
relaxation time. 

14. The apparatus of claim 13 wherein: 

said behavior of said synapse weights is a weight 
differential equation for each synapse defining a 
time derivative of a corresponding synapse weight 
as a product of an exponential function of the cor- 
responding neuron temperature and a sum of prod- 
ucts of said synapse weights; and 

said means for continuously updating further com- 
prises means for solving said weight differential 
equation for said synapse weight repetitively at a 
rate corresponding to said weight relaxation time. 

15. The apparatus of claim 14 wherein said means for 
solving comprises means for solving said activity state 
differential equation, temperature differential equation, 
and weight differential equation in an ascending order 
associated with an order of ascending relaxation times, 
such that a result of a solving of a preceding differential 
equation in the ascending order is employed in solving 
a next differential equation in the ascending order. 

16. The apparatus of claim 15 wherein said predeter- 
mined training vector corresponds to a classification 
problem and said temperature relaxation time is shorter 
than said weight relaxation time. 

17. The apparatus of claim 15 wherein said predeter- 
mined training vector corresponds to a continuous map- 
ping relation, and said weight relaxation time is shorter 
than said temperature relaxation time. 

18. The method of claim 10 wherein said network is a 

fully recurrent neural network, 
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